arXiv:1505.00970vl [astro-ph.IM] 5 May 2015 


Astron. Nachr. /AN No. 1-16 (2015)/DOI please set DOll 


The Gaia spectrophotometric standard stars survey — II. 

Instrumental effects of six ground-based observing campaigns.* 

G. Altavilla 1 ’**, S. Marinoni 2 3 , E. Pancino 1 3 , S. Galleti 1 , S. Ragaini 1 , M. Bellazzini 1 , G. Cocozza 1 , 
A. Bragaglia 1 , J. M. Carrasco 4 , A. Castro 5 , L. Di Fabrizio 6 , L. Federici 1 , F. Figueras 4 , M. Gebran 7 , 
C. Jordi 4 , E. Masana 4 , W. Schuster 5 , G. Valentini 8 , and H. Voss 4 

1 INAF-Osservatorio Astronomico di Bologna, Via Ranzani 1,1-40127 Bologna, Italy 

2 INAF-Osservatorio Astronomico di Roma, Via Frascati 33,1-00040, Monte Porzio Catone (Roma), Italy 

3 ASI Science Data Center, via del Politecnico SNC, 1-00133 Roma, Italy 

4 Departament d’Astronomia i Meteorologia, Institut de Ciences del Cosmos (ICC), Universitat de Barcelona (IEEC-UB), 
Marti i Franques, 1, 08028 Barcelona, Spain 

5 Observatorio Astronomico Nacional, Universidad Nacional Autonoma de Mexico, Apartado Postal 106, C. P. 22800 
Ensenada, BC, Mexico 

6 Fundacion Galileo Galilei - INAF, Rarnbla Jose Ana Fernandez Perez, 7, 38712 Brena Baja, Tenerife, Spain 

1 Department of Physics and Astronomy, Notre Dame University Louaize, PO Box 72, Zouk Mikael, Zouk Mosbeh, 
Lebanon 

8 INAF-Osservatorio Astronomico di Teramo, Via Mentore Maggini snc, Loc. Collurania, 64100 Teramo, Italy 

Received 2015, accepted 2015 
Published online later 

Key words instrumentation: detectors - methods: data analysis - techniques: photometric - techniques: spectroscopic - 
telescopes 

The Gaia SpectroPhotometric Standard Stars (SPSS) survey started in 2006, it was awarded almost 450 observing nights, 
and accumulated almost 100 000 raw data frames, with both photometric and spectroscopic observations. Such large ob¬ 
servational effort requires careful, homogeneous, and automated data reduction and quality control procedures. In this 
paper, we quantitatively evaluate instrumental effects that might have a signihcant (i.e., >1%) impact on the Gaia SPSS 
flux calibration. The measurements involve six different instruments, monitored over the eight years of observations ded¬ 
icated to the Gaia flux standards campaigns: DOLORES@TNG in La Palma, EFOSC2@NTT and ROSS@REM in La 
Silla, CAFOS@2.2m in Calar Alto. BFOSC@Cassini in Loiano, and LaRuca@1.5m in San Pedro Martir. We examine 
and quantitatively evaluate the following effects: CCD linearity and shutter times, calibration frames stability, lamp flex¬ 
ures, second order contamination, light polarization, and fringing. We present methods to correct for the relevant effects, 
which can be applied to a wide range of observational projects at similar instruments. 
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1 Introduction 

GaitQ is a cornerstone mission of the ESA (European Space 
Agency) Space Program, launched in 2013 December 19. 
The Gaia satellite is performing an all-sky survey to obtain 
parallaxes and proper motions to //as precision for about 10 9 
objects down to a limiting magnitude of V ~ 20 mag, and 
astrophysical parameters (T e g, log g, E(B — V), metallicity, 
etc.) plus 2-30 km s _1 precision — depending on spectral 
type — radial velocities for several millions of stars with 
V < 16 mag. Such an observational effort will have an im¬ 
pact on all branches of astronomy and astrophysics, from 

* Based on data obtained with: BFOSC@Cassini in Loiano, Italy; 
EFOSC2@NTT in La Silla, Chile; DOLORES@TNG in La Palma, Spain; 
CAFOS@2.2m in Calar Alto, Spain; LaRuca@1.5m in San Pedro Martir, 
Mexico (see acknowledgements for more details). 

** Corresponding author: e-mail: giuseppe.altavilla@oabo.inaf.it 

1 http://www.cosmos.esa.int/web/gaia/home 


solar system objects to distant QSOs (Quasi Stellar Objects, 
or Quasars), from the Galaxy to fundamental physics. The 
exquisite quality of Gaia data will allow a detailed recon¬ 
struction of the 6D spatial structure and velocity field of 
the Milky Way galaxy within ~ 10 kpc from the Sun, pro¬ 
viding answers to long-standing questions about the origin 
and evolution of our Galaxy, from a quantitative census of 
its stellar populations, to a detailed characterization of its 
substructures, to the distribution of dark matter. Gaia will 
also determine direct geometric distances to many kinds of 
distance standard candles, setting the cosmological distance 
scale on extremely firm bases. The Gaia scanning law will 
cover the whole sky repeatedly (~70 t imes o n average, over 


the planned 5-years G aia lifetime, see ICarrasco et al.ll2007 


Lindegren et ah 2008 ). therefore, a large number of tran¬ 


sient alerts will be a natural byproduct (for example, many 
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SNe will be discovered, see Altavilla et al. 201 2| 1 For 


review of Gaia science cases, see Perryman et al 


( 2001 ) 


Mignard (20051, and Prustil ( 2011 ). The challenges in the 
data processing of this huge data volume require a team 
of hundreds of scientists and engineers (organized in the 
DPAC, or Data Analysis and Processing Consortium, see 
Mignard et al.ll2008b . 

Gaia is usually described as a self-calibrating mission, 
but it also needs external d ata to fix the zero-point of the 
magnitu de/flux syste m (Pan cino et al.ll2012l) and radial ve¬ 
locities ( Soubiran et al.lCOl 3l) , an d totrain the classificatio n 
and parametrization algorithms (IBailer-Jones et al.ll2013l) . 
Gaia photometry will come from the astrometric array of 
CCDs in the Gaia G-band, a filter-less band whose profile is 
defined by the reflectivity of the mirrors and by the sen¬ 
sitivity of the detectors, and from the blue (BP) and red 
(RP) spectro-photometers which will produce dispersed i- 
mages with 20< A/AA <100 over the spectral ran ges 3 30- 
680 nm and 640-10 50 nm, respectively (see also Jordi et afl 
2010l : Prusti 2012h . The derived spectro-photometry is es¬ 
sential not only to properly classify stars or estimate in¬ 
terstellar extinction, but also to compute the chromaticity 
correction to centroids of stars, a fundamental piece of in- 
formation to achieve the planned astrometri c performance 
( Busonero et al.ll2006t Lindegren et al.112008 ). 

The final conversion of internally-calibrated G instru¬ 
mental magnitudes and BP/RP instrumental fluxes into 
physical units requires an external ab solute flux scale, tha t 
our team is in charge of providing (IPancino et al.l l2012h . 
The ideal Gaia spectrophotometric standard stars (SPSS) 
grid should comprise of the order of 100 SPSS in the range 
9 < V < 15 mag, properly distributed in the sky to be 
observed by Gaia as many times as possible. The mission 
requirement is to calibrate Gaia data with a n accu racy of a 
few percent (1-3%) with respect to Vega ( Bohlinl2007 ). We 
use as Pillars the three pure Hydrogen WDs (White Dwarfs) 
adopted by Bohlin. Colina & Finlevl ( 1995 ) as fundamental 
calibrators. Because the Pillars are not always visible, we 
are also building an intermediate grid of calibrators ( Pri¬ 
mary SPSS) that are observable all year round, with 2—4 m 
class telescopes. 

A set of more than 100 flux tables with <1% inter¬ 
nal consistency and <1-3% absolute calibration with re¬ 
spect to Vega_ are of interest for many scientific appli¬ 
cations (see Bohlin. Gordon. & Tremblay 20141 for a re¬ 
view of spectrophotometric methods and catalogues), in¬ 
cluding dark e nergy surveys based on type la supernovae 
( Sullivan et al] 201lh . Our observing campaign to build a 
grid of SPSS, for the flux calibration of Ga_ia da ta, is a mong 
the largest of this kind to date (IPancino et al. 2012p. Ob¬ 
servations started at the end of 2006, on six different tele¬ 
scopes and instruments, and will presumably end in 2015. 
Almost 100 000 raw frames were collected and we faced the 
challenge of analyzing these data as uniformely, automat- 


2 Indeed, Gaia started already discovering SNe, as testified by the ATel 
(http://www.astronomerstelegram.org /1 telegrams published so far. 



Fig. 1 An example of non-uniform CCD illumination 
caused by a too short t eX p, for an iris-type shutter, obtained 
with LaRuca@1.5m in San Pedro Martir, Mexico. The fig¬ 
ure shows the ratio between a flat field obtained with a rel¬ 
atively long exposure time (20 sec) and a very short one 
(0.1 sec): ADU (Analogue-to-Digital Unit) variations of 
about ±5% are observed from the center to the corners of 
the image. 


edly, and carefully as possible. To ensure that the maximum 
quality could be obta ined from SPSS observations, a s et of 
careful observations ( Pancino et alJl2008l 20091 20 111) and 
data reduction dCocozza et alJl2013t iMarinoni et al.l l2012h 


protocols was implemented, following an initial assessment 


^Altavilla et al. 2011 . 2014 : MarinonfcOl 1 : Marinoni et al., 

20131) of all the instrumental effects that can have an impact 
on the flux calibration precision and accuracy. Our final goal 
was to make sure that all the instrumental effects that could 
affect the quality of the final SPSS flux tables were under 
control, and corrected with residuals below 1%. 


This paper presents the methods and results of such in¬ 
strumental effects study, and is organized as follows: Sec- 
tion[2]presents the shutter characterization of the employed 
CCD cameras; Section [3 presents the CCD linearity stud¬ 
ies; Section 0] presents the methods and results of the cal¬ 
ibration frames monitoring campaign; Section [5] presents a 
study of lamp flexures with DOLORES at the TNG (Tele- 
scopio Nazionale Galileo); Section[6]presents a study of the 
effect of polarization on the accuracy of flux calibrations; 
Section [7] presents a method to correct low resolution spec¬ 
tra for second-order contamination effects; and finally Sec- 
tion[8]presents our summary and conclusions 
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2 Shutter characterization 

To obtain accurate and precise photometry in our survey, we 
need to characterize two shutter quantities: the shutter delay 
and the minimum exposure time. 

The shutter delay time, also called shutter dead time 
or shutter offset, is the difference between the requested 
and the effective t exp (exposure time). If the shutter closes 
slower than it opens, the exposure time will be longer than 
requested, and the shutter delay might be positive, otherwise 
it is negative. Knowledge of the shutter delay time is neces¬ 
sary for high-precision photometry, where the effective t eX p 
is needed. 

The minimum acceptable t exp is set by the finite time 
the shutter takes to travel from fully closed to fully open 
(and vice versa). It is not strictly related to the shutter delay. 
A shutter can have a very small shutter delay (for example 
because it opens and closes with very similar delays), but it 
might take some time to completely open. A different shut¬ 
ter can have a significant shutter delay (for example because 
it starts closing with a significant delay), but it might have a 
negligible minimum acceptable time because once it starts 
opening (or closing) it completes the operation in a very 
short time. Regardless of the shutter delay, a shutter that 
takes a long time to fully open (or close) might introduce 
significant] illumination non-uniformities across the CCD. 
To avoid illumination non-uniformities, t exp must be longer 
than a safe minimum (see Figure Q]for an example). 

2.1 Observations and data reductions 

To measure shutter effects, series of imaging flat fields with 
different exposure times were obtained at each telescope 
(see Table []}■ Each series consisted of triplets of flats with 
increasing exposure time, from very short to very long (thus 
useful for measuring CCD linearity as well, see Section^. 
After each triplet, a monitoring triplet with constant expo¬ 
sure time was also taken, to monitor the lamp intensity con- 
stanc>0, following the so-called bracketed repeat exposure 
method (see Section Ofor more details). 

All flat field images were processed with IRAfjf] by cor¬ 
recting them for overscan (where applicable) and subtract¬ 
ing a master bias. For all analyzed instruments the dark cur¬ 
rents turned out to be negligible (see Section|4]i and no cor¬ 
rection for dark was applied. The median of images belong- 

J Owing to our re quirements for the flux calibration of Gaia data 
Ivan Leeuwen & Richard] I2012I) . we cannot accept illumination non¬ 
uniformities above 1 % approximately. 

4 The most common cause of lamp instability is thermal drift, causing 
the intensity to increase for a given exposure time as the lamp gets warmer. 

5 IRAF is the Image Reduction and Analysis Facility, a general pur¬ 
pose software system for the reduction and analysis of astronomical data. 
IRAF is written and supported by the IRAF programming group at the 
National Optical Astronomy Observatories (NOAO) in Tucson, Arizona. 
NOAO is operated by the Association of Universities for Research in As¬ 
tronomy (AURA), Inc. under cooperative agreement with the National Sci¬ 
ence Foundation. 



Fig. 2 Example of lamp drift correction, based on data 
obtained with LaRuca@1.5m in August 2008 (see TableO}. 
Median ADU per second of each triplet are plotted as a 
function of time at triplet start. Red symbols represent mon¬ 
itoring triplets, while black symbols represent the actual se¬ 
ries triplets. Empty symbols refer to uncorrected triplets, 
filled symbols to triplets corrected for lamp drift (see Sec¬ 
tion ED - After correction, monitoring triplets are aligned, 
while data triplets present some residual scatter (less than 
1%), presumably caused by residual lamp instability after 
correction. A non-linear behaviour at low exposure times 
(left side of the plot) is clearly visible. 

ing to the same triplet was computed to remove cosmic ray 
hits and to reduce the noise. 

To correct for lamp drifts, each median image resulting 
from a monitoring triplet was divided by a reference moni¬ 
toring triplet (generally the first one of the same sequence), 
to derive a correction factor. The correction factor was used 
to correct each triplet — both monitoring and data triplets 
— and to report them to an ideal ADU level, free of lamp 
drift variations. The results of this correction are illustrated 
in Figure [2] The same type of data sequences, extending to 
longer t exp , can be used for the CCD linearity characteriza¬ 
tion described in Section 0 where the lamp drift correction 
is performed in the same way. 

2.2 Shutter delay 

We used two different methods to measure the shutter de¬ 
lay. In the first method, a linear extrapolation of the ADU 
(Analogue-to-Digital Units) versus t exp relation crosses the 
time axis at a value that is generally different from zero, and 
that corresponds to the shutter delay (Figure 0 top panel). 
In the second method, the count rate (ADU per second) are 
plotted versus ADU and should ideally remain constant; in 
practice, a deviation occurs at low t exp . By iteratively ad¬ 
justing the exposure time to t + St, the shutter delay cor¬ 
responds to the dt that minimizes the residuals from a flat 
count rate, at the low t exp end (see also Figure [3] bottom 
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Table 1 Flat field sequences used for the shutter delay, minimum acceptable exposure time, and linearity measurements. 
The columns contain (see text for more information): (1) the used instrument@telescope combination; (2) the sequence of 
exposure times used; (3) the exposure time of the monitoring exposures, when available; (4) the used filter or grism; (5) the 
used slit width in arcseconds, when applicable; (6) the observation date; (7) the experiment for which the measurements 
were used, with ‘a ’ indicating the shutter delay measurement, ‘b ’ the minimum acceptable exposure time measurement, 
and ‘c’ the linearity measurement. Of type a and c observations, the ones taken on 2009 January 29 lack monitoring triplets, 
because the data were downloaded from the TNG archive. 

Instillment 

texp 

(sec) 

t m 

l exp 

(sec) 

Filter/grism 

Slit 

(”) 

Date 

(yyyy Month dd) 

Notes 

BFOSC@Cassini 

0.5, 1, 2, 3, 4, 7, 10, 20, 30, 40, 50, 55, 60, 65 

5 

B 

- 

2009 May 27 

a,b 


6, 26 

10 

gr7 

2 

2010 Aug. 30 

c 

EFOSC2@NTT 

10, 60, 120 

40 

gr3 

2 

2008 Nov. 26 

c 


3, 15,30 

10 

gr3 

2 

2008 Nov. 27 

c 


0.1, 0.2, 0.5, 1,2, 3, 6, 15, 20 

10 

B 

- 

2008 Nov. 28 

a,b 

DOLORES @ TNG 

5, 20, 55 

25 

VHR-V 

2 

2008 Jan. 17 

c 


1, 2, 4, 5, 6, 7, 8, 9 

- 

B 

- 

2008 Jan. 29 

a,b 

CAFOS@2.2m 

1. 5, 10, 20, 25, 30 

15 

B200 

2 

2007 Apr. 01 

c 


0.1, 0.5, 1,2, 3, 4, 5, 6, 20, 30, 40 

- 

R 

- 

2008 Apr. 17 

b 


0.1, 0.2, 0.5, 1, 2, 3, 4, 5, 6, 15, 20, 30 

- 

R 

- 

2008 Sep. 09 

b 


0.1, 0.2, 0.4, 0.6, 0.8, 1,4, 6, 8,8.7 

2 

V 

- 

2010 Sep. 23 

a,b,c 

LaRuca@ 1.5m 

0.1, 0.2, 0.4, 0.6, 0.8, 1, 2, 3, 4, 10, 20 

- 

White 

- 

2008 Aug. 20 

b 


1, 2, 4, 8, 15, 18, 20, 25, 30, 35, 36, 37 

10 

B 

- 

2008 Aug. 23 

a,c 


2, 4, 7, 14, 28, 40, 56, 62, 68, 74, 80, 87, 89 

20 

R 

- 

2010 Jul. 17 

a,c 


0.2, 0.4, 0.6, 0.8, 1, 2, 3, 4, 5, 6, 10, 15, 20, 30 

- 

R 

- 

2010 Jul. 17 

b 


p 

a 

<4000 


U2000 


0.2 0.4 


0.6 


0.8 



Fig. 3 Example of shutter delay determination, for Calar 
Alto observations obtained on 2010 September 23. Top 
panel: a zoom into the low t eX p region of the classical ADU 
versus t exp plot; the dotted line represents a linear fit of all 
data and it does not intersect the X axis at zero, but at - 
0.023 sec. Bottom panel: residuals of the count rate (ADU 
s “ 1 ) from their average value, as a function of ADU; empty 
dots are the measured values, while filled dots are corrected 
for a shutter delay of -0.011 sec. The average of the two 
determinations, -0.017 sec, is reported in Table [2] 


panel). We applied both methods to the data in Table [§ 
We obtained consistent values and thus we averaged the re¬ 
sults from the two methods to obtain a final shutter dela\0. 
Results are reported in Table [2] 


2.3 Minimum acceptable exposure time 

The effect of illumination variations was estimated by divid¬ 
ing each of the flat fields in a series (Table[T|i by the longest 
non-saturated flat field in the same series. The resulting ratio 
images (an example is shown in Figure [7} should have os¬ 
cillations or variations which are caused by noise only, and 
no systematic large scale patterns with amplitudes higher 
than roughly 1%, as required by the Gaia mission calibra¬ 
tion goals. The minimum acceptable exposure times, t™‘ p , 
obtained with this criterion are listed in Table [2] While the 
shutter delay is generally much lower than one second, t™™ 
can be higher than that. We stress the fact that if flat fields 
acquired with too short t exp are used to correct all frames of 
one night, they will affect also scientific images taken with 
relatively long t exp , and should thus be avoided. 


6 We note that the BFOSC shutter was changed in 2010 February, with 
a much faster one providing a negligible shutter delay, but vignetting with 
t eX p lower than 5 sec (R. Gualandi, 2010, private communication). Simi¬ 
larly, the old diaphragm shutter of CAFOS@2.2m was replaced by a more 
efficient 2-blades shutter during Summer 2008 (Santos Pedraz, 2010, pri¬ 
vate communication). 

7 In practice, we applied the lineality correction (see Section [3j to the 
shutter data and the shutter delay correction to the linearity data, before 
re-computing both the linearity fit and the shutter delay. As expected, such 
corrections had a negligible impact on the resulting values. 
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Table 2 Resulting shutter delays and minimum expo¬ 
sure times for the relevant flat field series in Table Q] The 
columns contain (see text for more information): (1) the 
used instrument@telescope combination; (2), the shutter 
delay; and (3) the minimum acceptable exposure time. 


Instrument 

cit 

(sec) 

i.min 

*-exp 

(sec) 

BFOSC@Cassini“ 

-0.300±0.050 

~5 

BFOSC@Cassini 6 

negligible 

~5 

EFOSC2@NTT 

+0.008±0.001 

<0.1 (if any) 

DOLORES@TNG 

-0.011±0.002 

<1 (if any) 

CAFOS@2.2m“ 

(no data) 

3 

CAFOS@2.2m d 

-0.017±0.070 

<0.5 

LaRuca@1.5m 

+0.028±0.004 

5 


“Valid before 2010 February (see text). 
b Valid after 2010 February (see text). 
“Valid before Summer 2008 (see text). 
d Valid after Summer 2008 (see text). 


3 CCD linearity 


Linearity is a measure of how consistently the CCD re¬ 
sponds to different light intensity over its dynamic range. 
CCDs can exhibit non-linea rities, typic all y at e ither or both 
low and high signal levels ( Diorgovski & Dickinson! 1989t 
Walkerll993l) . High quality CCDs show significant response 


deviations (> 1%) from linearity only close to saturation, 
i.e., at high signal levels when the potential well depth is 
almost full. Of course the CCD response strongly deviates 
at saturation, when the potential well depth is full and addi¬ 
tional incoming photons do not increase the photoelectrons 
in a given pixel. When observations are restricted to the lin¬ 
ear portion of the dynamic range, the CCD performs as a 
detector suitable for accurate spectrophotometric measure¬ 
ments. 


A few different methods have been presented in the lit- 



serving campaigns, we tested the CCD linearity with two 
methods, described below. 


The classical method is based on a series of imaging 
flat fields with increasing t exp . Usually, at very high count 
levels the 1:1 relationship between ADU and t eX p breaks, 
and an increment of t exp does not correspond anymore to 
a fixed increase in ADU (an example can be found in Fig¬ 
ure 0). We used the relevant data listed in Table [T| reduced 
and corrected for lamp drift as described in Section [2~i~l We 
considered that a deviation of >1% from linearity was not 
acceptable given our requirements, and thus we report in 
Table |3]the ADU level at which such deviations occur. 



Counts[ADU] 

Fig. 4 Example of linearity test for Calar Alto observa¬ 
tions obtained on 2010 September 23, with the SITE#ld_15 
CCD. Top panel: a zoom into the high ADUs region of the 
classical ADU vs. t exp plot; the dotted line represents a lin¬ 
ear fit of all data. Bottom panel: residuals of the count rate 
(ADU _1 ) from their average value, as a function of counts; 
the dotted line represents a constant count rate. In both pan¬ 
els, points deviating by more than 1% are represented by 
empty circles and coloured in red. 


The second method we tested is described by 
Stello et ah ( 20061) . and is a variation on the literature meth¬ 
ods described above. It requires the acquisition of at least 
two spectroscopic flat fields, one reaching the saturation 
level and the other covering a fainter intensity range. The 
choice of slit width and grism (or grating) is not crucial, 
but it is important to obtain flat fields with a wide — and 
preferably monotonic increasing — intensity range along 
one direction (either a CCD column or line). The frames 
were collapsed along the dispersion direction, and corrected 
for lamp drifts as explained in Section 12.11 but taking into 
account that lamp drifts in spectra can show also colour vari¬ 
ations. Afterwards, w e ap plied the method as described in 
the original paper by Stello et al. ( 20061) - by computing the 
intensity ratio of exposure pairs, normalized by their respec¬ 
tive t exp , which is called gain-ratio curve. This ratio is the 
starting point of an iterative procedure of inversion to deter¬ 
mine the gain at different intensity levels. 

In conclusion, given our simple purpose of estimating a 
safe ADU limit for our observations, the cla ssic al method 
and the high-precision method ( Stello et~ai1l2006l) give sub¬ 
stantially the same results. While the required spe ctroscopic 
observations are much faster to obtain, the Stello et al. 


( 2006 ) method is highly sensitive to different choices for 
the fit of the gain-ratio curves (function, order, and rejection 
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Table 3 Resulting maximum acceptable ADU levels obtained from the flat series in Table [j] The columns contain (see 
text for more information): (1) the used instrument@telescope combination; (2) the CCD used; (3) the validity period of 
each CCD; (4) the max imum accep table ADU level obtained with the classical method; (5) the maximum acceptable ADU 
level obtained with the Stello et al. ( 2006 ) method, when applicable; and (6) the final adopted value for each CCD: when 
there were no sufficient data, the safe value was assumed conservatively on the basis of that obtained for similar CCDs; in 
those cases, the values are indicated in parentheses. 


Instrument 

CCD 

Validity period 

A TAT r( (Jlassic ) 

U max 

Ar)TT^ stell °^ 

u max 

a rtT T( Ado P ted ) 

l\LJ u max 

BFOSC@Cassini 

EEV 1300x 1340B (old) 

Before 2008 Jul. 17 



(50000) 


EEV 1330x 1340B (new) 

After 2008 Jul. 17 


60 000 

60000 

EFOSC2@NTT (lxl bin) 

CCD#40 LORAL/LESSER 

All runs (from 2007) 


49 000 

49 000 

EFOSC2@NTT (2x2 bin) 


All runs (from 2007) 


47 000 

47 000 

DOLORES@TNG 

E2V 4240 (Marconi) 

All runs (from 2007) 

60000 

60 000 

60000 

CAFOS@2.2m 

SITE# Id. 15 

All runs (from 2007) 

55 000 

53 000 

53 000 

LaRuca@ 1.5m 

SITE#1 

Until 2009 Jul. 

60000 


60000 


ESOPO 

2009 Oct. 20-22 



(60000) 


E2V 4240 (Marconi) #1 

2009 Oct. - 2010 Dec. 

60000 


60000 


E2V 4240 (Marconi) #2 

2011 Mar. 9-17 



(57 000) 


SITE#4 

2011 May 2-11 



(60000) 


Table 4 List of grisms used for the SPSS campaigns with the four spectrographs. The columns contain: (i) the instru- 
ment@telescope combination; (ii) the grism name; (iii) the minimum wavelength; (iv) the maximum wavelength; (v) the 
approximate wavelength at which second order contamination starts (if any, see text); (vi) the approximate wavelength at 
which fringing starts (if any, see text); and ( vii) the approximate relative amplitude of the fringing pattern, as measured on 
our spectra before correcting for fringing. All figures were derived from our SPSS spectra. 


Instrument 

Grism 

-^min 

(A) 

Amax 

(A) 

x {start) 
/v 2ndorder 

(A) 

x {start) 
''fringing 

(A) 

max fringing 
(±%) 

BFOSC@Cassini 

#3 

3300 

6420 

— 

— 

— 


#5 

4800 

9800 

— 

~7000 

10-15 

CAFOS@2.2m 

B200 

3200 

9000 

— 

— 

— 


R200 

6300 

11000 

>12000 

~8500 

<5 

EFOSC2@NTT 

#5 

5200 

9350 

— 

~7300 

£10 


#11 

3380 

7520 

— 

— 

— 


#16 

6015 

10320 

~6000 

~7300 

£10 

DOLORES@TNG 

LR-B 

3000 

8430 

~6000 

— 

— 


LR-R 

4470 

10073 

~9500 

~8000 

5-15 


method), and to their extrapolation at zero counts. There¬ 
fore, the method is not as straightforward to apply as the 
classical method. To estimate the ADU level at which our 
data deviate more than 1% from linearity, we conservatively 
chose the lowest of the two estimates, in the few cases where 
the two methods provided different results. Results for the 
employed instruments can be found in Table [3] 

4 Calibration frames 

The quality and stability of the calibration frames was tested 
on all the acquired calibration data, producing recommen¬ 
dations for the calibration plans that were progressively re¬ 
fined as more and more data were available for the analy¬ 
sis. We found a surprising variety of behaviours among dif¬ 
ferent instruments and observing sites. Considering that we 
acquired of the order of 100000 frames in our campaigns 


20121 for the SPSS campaigns descrip¬ 
tion), it was important to implement automated procedures 
to identify those calibration frames that needed a closer in¬ 
spection, before validating the corresponding reduced data. 
Whenever possible, we acquired daily or nightly calibration 
frames, but in case of problems, we used the derived cali¬ 
bration plan presented in Table [5] to decide how to analyze 
the data (i.e., which of the archival or available frames were 
acceptable) and how to judge the quality of the applied re¬ 
ductions. In the following sections we briefly describe the 
monitoring strategy and the main conclusions on each in¬ 
strument and telescope. 


(see Pancino et al 


4.1 CCD cosmetics 

We employed BPMs (Bad Pixel Masks), i.e., images of the 
same dimension of the CCD frames, where good pixels are 
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Fig. 5 Final gain curves obtained with the IStello et al 


(120061) method for NTT, using data obtained on the 26 and 
2008 November 27, with lxl and 2x2 binning, respec¬ 
tively, with the ESO grism #3. The dashed line is for the 
2x2 binning while the solid line is for the lxl binning. 
Very small deviations from linearity start to appear already 
around 5000 ADU, while deviations above 1% kick in at 
~47 500 ADU for the 2x2 binning and at 49000 ADU for 
the 1 x 1 binning. 


Fig. 6 Example of fringing correction (see text) on a spec¬ 
trum of BD+284211, observed in Loiano on 2013 Octo¬ 
ber 28, with gr5 and a 2.5” slit. The blue solid line is the 
spectrum after fringing correction, based on a day-time flat, 
while the red dotted spectrum is before the correction. Beat¬ 
ings are observed, caused by small differences in the fring¬ 
ing pattern of the spectrum and the flat used for the correc¬ 
tion. For example, aroud pixel 900 the residuals are below 
5%, while around pixel 1000 they remain higher, around 
15%. 


identified by zero values and bad pixels by non-zero values. 
BPMs were created once or twice a year, using the ratio of 
two flat fields with different count levels — where pixels 
responding non-linearly stand clearly out — by means of 
the IRAF ccdmask task. They were applied with the IRAF 
taskfixpix, which replaces bad pixels with estimates of their 
correct value. The estimates are built using a linear interpo¬ 
lation across the nearest non-bad pixels, either across lines 
or columns, or both, depending on the local characteristics 
of the BPM (see task description for more details), i.e., dif¬ 
ferently for bad lines, bad columns, or clusters of bad pixels. 


4.2 Fringing 


Fringing, caused by multiple reflections and interferencesof 
red or infrar ed light in thin CCD substrates (IHowellll2012 ; 
LesseillT990l) . was not relevant for the imaging filters used 


in our observations (mostly B, V, and rE but was present 
and relevant in all our spectra, because we needed to cover 
the whole Gaia wavelength range, from 330 to 1050 nm. All 


8 As in the above cases, fringing is considered to be relevant when the 
fringing pattern in images or spectra has an amplitude larger than 1% of 
the signal. 

9 The procedure for correcting fringing in images i nvolves the con¬ 
struction of a super-flat and is well described elsewhere iGullixsonlll992t 
lNewberrvlll99lllTvsonll990l) . 


the instruments used for spectroscopy do suffer from fring¬ 
ing (see Table [4]), well visible in our spectra (see Figure [6j. 
For NTT we took flat fields very close to our red grism 
observations, while for all other telescopes we only relied 
upon day-time lamp flats. However, while flat fielding does 
reduce the intensity of fringes, the best fringing reduction 
strategy for spectroscopy, i.e., observing triplets of spectra 
with the star in different positions along the slit, could not 
be applied at the chosen telescopes because it would have 
been too time consuming, observationally. 


The solution we adopted was to apply specific spec¬ 
tra processing steps to reduce the impact of fringing, 
adapting the procedure used in the mkfringeflatco E3 STIS 
(Space Telescope Ima ging Spectrograph) task for IRAF 
( Malumuth et al.l 2003 ). Briefly, the most appropriate flat 
field available for each scientific spectrum was extracted 
from a region covering exactly the same pixels covered by 
the scientific spectrum, collapsed to a ID spectrum along 
the dispersion direction, and normalized to one. The region 
that does not contain a significant fringing pattern was then 
flattened exactly to 1, to avoid adding extra noise to the sci¬ 
entific spectra. Finally, this specially prepared ID spectro¬ 
scopic flat was aligned to the scientific spectra and scaled 


10 http://stsdas.stsci.edu/cgi-bin/gethelp.cgi7mkfringef 
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Fig. 7 Example of long-term behaviour of the average 
counts (with their sigma) of REM monthly master dark 
frames with t eX p=60 s, covering roughly 3 years. A seasonal 
trend is clearly visible. 


until the residuals of the fringing pattern were minimized, 
using the IRAF telluric task, and applied to scientific spec¬ 
tra to correct them for fringing. 

We found that even using spectroscopic flat fields ob¬ 
tained in the day-time (thus with different incident light pat¬ 
terns), the procedure greatly helps in reducing the amplitude 
of fringes, up to a factor of 2-3 in relative intensity (see Fig¬ 
ure [6|. Sometimes, the closeness in position of the day-time 
flats fringe patterns to the SPSS fringe pattern produced 
beatings in the residuals of the fringing correction, with re¬ 
gions were the pattern was almost completely erased and re¬ 
gions where a residual fringing pattern remained. This has 
to be taken into account for applications of the method to 
scientific cases that could be sensitive to this kind of resid¬ 
ual effects. 

Additionally, in our spectroscopic campaign we often 
have observations of the same SPSS from different tele¬ 
scope and observing nights, and thus spectroscopic fringing 
residuals can be further reduced by building a median of the 
different spectra. The very red end of spectra (A A9500 A), 
where fringing is strong and S/N is low, will also be replaced 
with model or semi-emp irical template spectra, as done for 
example by Bohlin ( 2007 ). 


4.3 Dark frames 

Dark correction is the subtraction of the electron counts 
which accumulate in each pixel due to thermal noise. The 
reduction of dark currents is the main reason why all astro¬ 
nomical CCDs are cooled to liquid nitrogen temperatures. 
We took dark exposures as long as the longest scientific 
exposure, at least once per year. As expected, all used in¬ 
struments showed at most a few ADU per pixel per hour, 
except for the REM (Rapid Eye Movement) robotic tele¬ 
scope in La Silla, which is cooled by a Peltier system. The 
REM staff creates monthly master dark frames, containing 


15 

10 
S' 

Q 
oo 

< 5 

0 

-5 

Fig. 8 Example of long-term behaviour of the average 
counts (with their sigma) of San Pedro Martir daily mas¬ 
ter bias frames, covering more than three years. The CCD 
changes are marked and correspond to clear jumps in ADU 
and also in variations of the ADU stability with time. 



the bias structure as well (see below), thus we applied the 
closest master dark frame in time, taken with the correct 
t exp . A seasonal effect on the dark frames average level can 
be clearly seen in Figure[7] while the 2D shape of the frames 
was extremely stable and the ADU increased linearly with 

Cxp- 

4.4 Overscan and bias frames 

The bias current is an offset, preset electronically, to ensure 
that the Analogue-to-Digital Converter (ADC) always re¬ 
ceives a positive value and operates in a linear regime, as 
much as possible. The offset for each exposure given by the 
bias level has to be subtracted before further reduction, and 
may be modelled as A + B(xi,yi), where B is the pixel- 
to-pixel variation or 2D structure, which is time-invariant 
unless particular problems occur, while A is the overall bias 
level, that can change slightly during the night or even sea¬ 
sonally, owing to temperature changes. 

Generally A can be measured using a strip of pix¬ 
els, called overscan, acquired by continuing to readout the 
CCD beyond its real physical extent: the result is an over¬ 
sized array with a strip of signal-free pixels. In our survey, 
the overscan was available only for DOLORES @TNG and 
LaRuca@1.5m in San Pedro Martir, thus in all other cases 
we used the 2D bias frames to implicitly correct for A as 
well. This produces some additional uncertainty if A varies 
during the nighQ because the bias calibration frames were 
generally acquired in the afternoon (and also in the morning 
for the less stable instruments, like LaRuca). The different 
stability of the four CCDs used with LaRuca is illustrated 

11 However, we found that in both the photometry and long-slit spec¬ 
troscopy cases, the standard sky subtraction procedures we adopted were 
sufficiently accurate to get rid of any residual bias offset, at least within the 
required 1% level. 
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Table 5 Calibration plan recommendations following our calibration frames monitoring (see text). For each type of 
calibration frame, the minimum acceptable stability timescale is reported. 


Instrument 

Dark 

Photometric 

Bias 

Spectroscopic 

Bias 

Photometric 

Dome Flat 

Photometric 
Sky Flat 

Spectroscopic 
Lamp flat 

Spectroscopic 
Sky flat 

BFOSC @Cassini 

— 

4 days 

4 days 

300 days 

varies 0 

1 day 

1 week 

EFOSC2@NTT 

— 

5 days 

1 day 

250 days 

250 days 

1 day 

1 week 

DOLORES@TNG 

— 

1 day 

1 day 

— 

250 days 

5 days 

1 week 

CAFOS@2.2m 

— 

1 week 

1 week 

varies 6 

1 day 

1 day 

1 week 

LaRuca@ 1.5m 

— 

varies 11 

— 

— 

5 days 

— 

— 

ROSS@REM d 

1 month 

— 

— 

— 

— 

— 

— 


“Varies from 1 to 7 days, depending on the CCD used. 

6 Was 1 day until Summer 2008, and 1 week afterwards. 

“Was 4 days until 2010 February, and 100 days afterwards. 

d REM flat fields were not applied, to avoid reinforcing the strong light-concentration effect from which ROSS suffers. 



CCD DOMEFLATS 

CHANGED 

• • 

• 

• . • • . . • ; 

: ; 

CCD SKYFLATS 

CHANGED 

• 

1 


-500 0 500 1000 1500 

elapsed nights from 17 Jul 2008 


Fig. 9 Example of long-term behaviour of the normalized 
shape variation (A S/S, see text) of BFOSC nightly dome 
(top panel) and sky (bottom panel) flat fields in B filter. The 
shape variations are rarely above 1%, globally, showing an 
impressive stability over a period covering more than 2000 
nights. The CCD change in Summer 2008 is marked by a 
vertical dotted line. 


in Figure [8] where the bias level — after overscan subtrac¬ 
tion 12 !— of all the master bias frames is plotted. 

4.5 Flat fields 

After bias subtraction and — if needed — dark correction, 
data values are directly related to the number of photons de¬ 
tected in each CCD pixel. But different pixels can be char¬ 
acterized by a different sensitivity: the flat fielding correc¬ 
tion accounts for the non-uniform CCD response to incident 
light, that can show variations on all scales, from the whole 
CCD to the single pixel. In imaging, fiat field frames are 
acquired using the twilight sky or a screen uniformly illu- 

12 It can happen that after overscan correction the average level of a bias 
frame remains above zero by a few ADUs, because of the way the CCD is 
read out. 


minated by a lamp, with each filter; they are used to correct 
for sensitivit y variations on all scales, a cting as a sort of sen¬ 
sitivity map. Massey & Hanson ( 20131) recommend the use 


of dome or screen fiat fields to correct for the small-scale 
sensitivity variations and of sky flat fields for large-scale 
variations. In spectroscopy, only the small scale variations 
are corrected with flat fields obtained through the spectro¬ 
graph, i.e., with a slit and a grism (or grating). Of course, 
flat field frames with low signal or saturated were always 
rejected (see Section^- 

For imaging flat fields, we were interested in monitoring 
the large scale shape variations with time. We thus took all 
the masterframes obtained for each instrument, normalized 
them by their mode, and smoothed them with a boxcar to 
remove small scale variations. We then computed a stabil¬ 
ity function K sta b, using the pixel-to-pixel difference of the 
counts in each masterflat (F p \ x ) with respect to a reference 
masterflat 


Ajstab — 


pix =1 


(F ■ - F r ? f ) 2 

t-fpix 1 pi xl 


(1) 


We can then define a normalized shape variation as 

AS 1 / K stah 
S V ^tab(l%) 


( 2 ) 


where K s t a b(l%) is computed in the case where the pixel- 
to-pixel variation is equal to 1% everywhere. Using this in¬ 
dicator, we have a significant variation (i.e., a global varia¬ 
tion above 1%) when A S/S > 1. 

Using the K sta b global indicator, we explored all the 
master flat fields obtained during our SPSS survey with au¬ 
tomatic procedures (see Figure[9]for an example), thus iden¬ 
tifying the flat fields that needed to be visually inspected for 
anomalies, before applying them to the scientific images. 


4.5.1 Illumination variations 


For imaging, dome and sky flat fields are_sufficient to 


correct for the CCD response variations ( Mass ey 1997 
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Massey & Hansonll2013 ). Instrumental effects, like internal 
light reflectionsor light concent ratio n, need instead ad hoc 
proce dures (IKoch et al.l l2004i iManfroid. Selman. & Jones 


200 1). Among the imagers used in the SPSS campaigns. 


only ROSS@REM showed a significant (up to 5-10%, de¬ 
pending on the amount of incident light) cushion-shaped il¬ 
lumination variation, that was worsened by the application 
of the flat-field correction, similarly to what happens with 
light concentration. Because ROSS@REM was only used 
for relative photometry, we decided not to apply any flat- 
field correction to REM images and to keep the SPSS as 
much as possible in the same X and Y position in the CCD 
during each time series. Nevertheless, relative photometry 
with ROSS @REM proved to be extremely difficult and only 
rarely met our requirement of —mi Hi mag accuracy. 

For spectroscopy, illumination variations caused by im¬ 
perfections on the slit borders, vignetting, and other sim¬ 
ilar effects, are usually correcte d with the help of twi¬ 
light sky spectroscopic fl at fields ( Massev & Hansonll2013l 


Schonebeck et ah 120141) . applied after normal (i.e., lamp) 


flat fielding. After some initial testing, we decided to ob¬ 
tain spectroscopic flats fields once per run, as a compromise 
between the stability of the results and the time needed to 
obtain the flats (see Table|5|. 


5 Lamp flexures 

Low resolution spectrographs are usually mounted at the 
telescope or at the telescope derotator, and they move dur¬ 
ing observations. At each different position, the varying pro¬ 
jection of the gravitational force leads to mechanical dis¬ 
tortions, which can be seen on the wavelength calibration 
frames, where they pr oduce linear or non-linear shifts of the 
lamp emission lines ( Munari & Lattanzil 1992h . A correct 
flux table must associate the right flux to the right wave¬ 
length, thus any error in wavelength has an impact on the 
flux calibration of a spectra n{3 

We performed a test to evaluate the DOLORES @TNG 
lamp flexures on 2008 January 31, when the instrument was 
equipped with three separate calibration lamps (He, Ne, Ar) 
that could not be switched on simultaneously, and the pro¬ 
cedure to obtain high S/N calibration lamps for each sin¬ 
gle star during night-time observations was too time con¬ 
suming. We used the LR-R grism, the 2” slit and the Ar 
lamp. Triplets of wavelength calibration lamp spectra were 
acquired at different positions of the derotator, covering a 
complete derotator circle, from -260 to + 100 degrees in 
steps of 10 degrees forward, and from +95 up to -255 de¬ 
grees in steps of 10 degrees backwards. The median of the 
three acquired lamp frames at each position was used to 
evaluate line shifts. 


13 It is important to remark that — especially in those parts of the spec- 
tram where the flux changes rapidly with wavelength — even small er¬ 
rors in the wavelength calibration can imply relatively large errors on the 
flux calibration. This problem becomes difficult to solve, i.e., by means of 
cross-correlation to align spectra, for featureless or nearly featureless stars, 
which are in general the best flux calibrators. 



Fig. 10 Example of lamp flexure effect on emission lines 
position of a TNG Argon wavelength calibration lamp ex¬ 
posure with the LR-R grism (see text). Each panel repre¬ 
sents the line peak shift in pixels (ordinate) of six individ¬ 
ual argon lines as a function of DOLORES derotator angle 
position (abscissa). Each green dot corresponds to a peak 
measurement in a different lamp exposure. The typical shift 
of almost ±1.5 pixels corresponds to almost 8 A. The thin 
black line is a polynomial fit to the data, to guide the eye. 


The resulting shifts of emission lines, as a function 
of derotator angle, are characterized by a quasi-sinusoidal 
trend, as shown in Figure[l0]for a selection of the examined 
lines. The typical size of the global oscillation was almost 
three pixels, with an error on the lines peak position rang¬ 
ing from 0.01 pixels in the central portion of the CCD to 
0.06 pixels at the CCD borders. The shift amounts to almost 
8 A in the space of wavelengths and thus daytime lamps 
are clearly not appropriate for the wavelength calibration 
of spectra in the Gaia spectrophotometric campaigns. For 
this reason, most astronomers also take wavelength calibra¬ 
tion lamps close to the position of each observed spectrum. 
Not being able to measure emission line shifts for all instru¬ 
ments, we also adopted the same strategy. In particular, if 
the night lamp observations required too much time — as 
was initially the case with TNG — we decided to employ 
day-time lamps with very high S/N, and shift them to match 
the night lamps taken with lower S/N in order to save time. 
In all other cases we used night-time lamps. 


6 Polarization 


The reflectivity of curved and plane mirrors may be diffe¬ 
rent for different directions of linear pol arizatio n, in awave- 
length dependent way (see Breckinridge & Oppenheimer 
20041 and references therein). An incoming beam of non¬ 
polarized light will be polarized according to this differ¬ 
ence of efficiency. This might affect the sensitivity of the 
whole observing system, especially if the light beam en¬ 
counters other polarization-sensitive optical elements after 
reflection. The degree of induced polarization depends on 
the angle of incidence: the maximum degree of (linear) po- 
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larization occurs for an incidence angle which depends on 
the material, called polarization angle. Refractive dispers¬ 
ing elements (grisms, in our case) may also — in princi¬ 
ple — produce some degree of polarization, or may have 
different transmission efficiencies for different intrinsic po¬ 
larization of the incoming light, thus introducing selective 
light losses. A combination of optical elements with differ¬ 
ent orientation could thus produce a significant variation in 
the efficiency, dependent on wavelength and/or pointing. 

We can envisage three cases relevant to our SPSS cam¬ 
paigns: 

1. The light beam encounters just one optical element po¬ 
tentially sensitive to polarization, for example the grism, 
as in the case of CAFOS@2.2m in Calar Alto. If the in¬ 
coming light is polarized, there may be a difference in 
the response depending on the actual orientation of the 
grism in the plane perpendicular to the direction of the 
light beam. If the incoming light is not polarized, there 
is no effect. 

2. There is more than one element potentially sensitive to 
polarization, but their relative orientation in the planes 
perpendicular to the direction of the incoming beam are 
fixed, as is the case of the flat mirror feeding the grisms 
of BFOSC@Cassini. The netresult is the same as above. 

3. There is more than one element potentially sensitive to 
polarization, and their relative orientation in the planes 
perpendicular to the direction of the incoming beam 
changes, depending on the telescope pointing. This is 
the case of the flat M3 mirror of the twin TNG and NTT 
telescope^ that drives the main light path to the Nas¬ 
myth focus where it feeds the grism of the DOLORES 
and EFOSC2 spectrographs, mounted on the derotator 
device. In this case some polarization effect can be in¬ 
duced even on non-polarized incoming light, and the 
grism may respond slightly differently — in terms of 
efficiency — to differently polarized light, resulting in 
a light loss depending on the relative orientation of the 
two elements (see Giro et al1l2003l) . 


6.1 The TNG polarization experiment 

While potentially highly polarized sources should be 
avoided as candidate SPSS (as for instance DP white dwarfs 
or heavily extincted stars), there is no observational strategy 
that can completely avoid the problem. We thus tested the 
above hypothesis on TNG, because it has the same NTT 
design, and they are the only two telescopes in our cam¬ 
paign where polarization could significantly affect spectro- 
photometric observations. 

dl99(j) : 


We observed two stars from Turnshek et al 


HD 155197, which exhibits a polarization in V band of 
4.38±0.03 % — that we will call polarized star here¬ 
after — and HD 154892, having a B-band polarization of 


0.05±0.03 % — that we will call non-polarized star. They 
were observed on a clear night (2012 June 1, kindly ob¬ 
tained by the TNG staff), with good seeing (0.8”-1.5”), 
using a wide slit (10”) and the same grisms used for the 
SPSS campaign (LR-R and LR-B, see Table |4]). Spectra 
were taken at different derotator angles of -160, -108, - 
56, and -5 degrees for the polarized star, and 170, 223, 
274, and 325 degrees for the non-polarized star. The spectra 
were bias and flat-field corrected, extracted, and calibrated 
in wavelength with the help of narrow slit (2”) spectra and 
day-time calibration lamps, using standard recipes from the 
IRAF apall and onedspec packa ges, co nfigured exactly as 
done for our SPSS reductions ( Cocozza et al .1120131) . We fi¬ 
nally corrected all spectra for atmospheric extinction, using 
an extinction curve built from our own data, even if the air- 
mass difference among the used spectra was small: 0.10 for 
the polarized star and 0.05 for the non-polarized star. 

As can be seen in Figure QT| the overall shapes of the 
spectra taken at different derotator angles agree with one 
another within 1%, except at the blue edge (below 4000 A), 
where the S/N is rather low, and above 8000 A, as expected 
because of fringing and telluric absorption (which were not 
corrected for in this particular test). This confirms first of all 
that the night conditions were photometric. The additional 
absence of significant slopes or oscillations confirms that 
both the stellar intrinsic polarization and the polarization 
supposedly induced by the instrument configuration had no 
significant impact on our flux measurements (i.e., not more 
than 1 %), even for different stellar polarization levels, wave¬ 
length regions, or derotator positions. 

7 Second-order contamination 

Low resolution spectrographs and large format CCDs al¬ 
low for observations of a wide wavelength range, as needed 
for the Gaia SPSS grid. However, with both grisms and 
gratings the light from different orders can overlap, espe- 
cially_if no cr oss disperser or blocking filter is employed 
(G utier rez-,Moreno et al. 19941) . Typically, light from the 
blue wavelengths of the second order can contaminate the 
red portion of the spectra, in the spectrographs used in our 
campaign. The characteristics of the contaminating light de¬ 
pend on both the spectral energy distribution of the observed 
source and the characteristics of the dispersing element. In 
particul ar, it is necessary to k now the grism dTraubl 199Gh or 
grating ( Szokolv et al .12004 equation to apply a correction. 

Two of our adopted instrument configurations, for 
EFOSC2@NTT and DOLORES @TNG, are known to 
possess significant second order contamination. For 
CAFOS@2.2m in Calar Alto the only visible second order 
light started well after 10000 A, after the end of the first- 
order spectrum in the rec@. For BFOSC@Cassini it was 


14 The EFOSC2@NTT web site (http://www.eso.org/sci/facili 
ties/lasilla/instruments/efosc-3p6/) reports: “ Preliminary analysis of 
instrument polarization shows <0.1% at field center, and ~ 0.4% at the 
edge (07/07/2005)”. 


15 The manual mentions a second order contamination for the B200 
grism starting after 6 500 A, but our data reach up to ~8 500 A and no 
contaminating light was detected; for R200, the manual mentions a second- 
order contamination stalling at 10000 A, but we could clearly see the sec- 


www.an-journal.org 


© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 





























12 


G. Altavilla et al.: Instrumental effects of six ground-based observing campaigns 


HD154B92 non polarized 
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Fig. 11 Results of our test on the effect of polarization on spectrophotometry. The left panels report observations of the 
“non-polarized” star HD 154892. while the right panels of the “polarized” star HD 155197 (see text for more details). The 
three top panels for each star refer to the LR-B grism, while the bottom panels to the LR-R grism. Each sub-panel shows the 
ratio of spectra taken at different derotator angles with respect to a reference angle, which is 170 deg for the non-polarized 
star and -160 deg for the polarized star. All ratios are shown as a function of wavelength. Dashed lines in each sub-panel 
indicate ±1% ratio variations. Except for the noisy blue edge, and the red parts were fringing is important, it is clear that 
the effect is negligible for our purposes. The increase in the fringing residuals pattern for the polarized star is caused by the 
fact that the polarized star was accidentally placed in different positions along the slit for different polarization angles. 


absent, both judging from the BFOSC manuals and web 
pages, and from the quality of our final spectra of CAL- 
SPEC0 stars. The chosen setups were unavoidable owing 
to our requirements to cover as much as possible the Gaia 
wavelength range (300-1100 nm) and to oversample the re¬ 
solution of the BP and RP (Blue and Red Photometers) by a 
factor of up to 4-5, whenever possible. 

W e th us emp loyed an_adaptation of the method by 
Sanchez-Blazquez et al. ( 2006i) to correct our spectra from 
second-order contamination. The original method requires 
observations of a blue and a red star, with well known cal¬ 
ibrated flux in the literature. They are used to build a re¬ 
sponse curve for the contaminating second-order spectra, 
by solving the following system of equations: 


S r — C\T r + C 2 T 2r 


(4) 


where the b subscript refers to the blue star and the r one 
to the red star. S is the observed spectrum, including the 
second-order contamination, while T is the true spectrum 
obtained from the literature (thus free from second-order 
contamination), and Ti is the second-order contaminating 
component. C\ is the response curve for the first order light 
and C 2 for the second order light: once they are known, 
any spectrum observed with the same instrument can be 
corrected for sec ond-order contamination as described by 
Sanchez-Blazquez et al. ( 20061 ). Since C\ and C 2 were not 
computed each night in our SPSS campaign, we derived the 
corrected spectrum (S' h for the blue star case) using 


S b = C{T b + C 2 T 2b 


(3) 


S'b = S b - C 2 S' 2b 


(5) 


ond order spectrum, starting well after the end of the first order spectrum, 
with no overlap between the two orders. 

16 http://www.stsci.edu/hst/observatory/crds/calspec.html 


where S' 2b is the ratio S b /C\, i.e., the second order contam¬ 
inating spectrum, after resampling and shifting, according 
to the function mapping the first order into the second order 
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Fig. 12 Example of our second order contamination cor¬ 
rection for a CALSPEC star (GD 71, one of the pillars) ob¬ 
served with NTT. The red (dotted) spectrum is our observed 
spectrum contaminated by second-order light coming from 
blue wavelengths. The black spectrum is the same spectrum 
after our second order correction procedure; it still contains 
telluric absorption bands and fringing. The green spectrum 
is the CALSPEC spectrum for the same star: the spectral 
shape is recovered to the 1% level. 


wavelength range (see following sections for more details), 
and still not calibrated in flux. 

The method by Sanchez-Blazquez et ali ( 2006 ) requires 
a single spectrum to cover the whole wavelength range, 
while in our case we always had two setups — with some 
overlap — observed in two slightly different moments of 
time. Our adaptation consisted in joining the blue and red 
spectra of the observed stars, after reporting the blue spec¬ 
trum to the airmass of the red one with a good extinction 
curve. A residual jump in the junction point does not have 
any effect on the quality of the final correction for second 
order. 

While the original method by Sanchez-Blazquez et al.l 


(20061) for deriving C\ and C 2 can be applied directly, 
even in non-perfectly photometric conditions, our particular 
adaptation requires some care in those cases where extinc¬ 
tion vatiations are large. Application of the method is safe in 
photometric and grcyjj nights, where extinction variations 
are lower than ~3%, roughly speaking. 


7.1 NTT second order correction 


The spectra of two well known flux standards, Feige 110 
(a blue star, Landolt & Uomotoil2007l) and LTT 1020 (a red 


star, Landolt 1992). where acquired with the blue and red 
EFOSC2 grisms #11 and #16 (see Table |4|, used for the 


whole SPSS campaign. Each star was observed with a 10” 
slit to gather all the flux, and with a 2” slit to obtain a reliable 
wavelength calibration. 

The spectra were extracted and wavelength calibrated 
as described in Section I6TTI We created an extinction curve 
from our own data, that compared well with the CTIO and 
ESO ones. We then joined the grism #11 and #16, reported 
to the same airmass, into a single spectrum, cutting them at 
6000 A, i.e., well before the expected start of second order 
contamination. We used the absorption lines of the blue con¬ 
taminating spectrum — well visible at red wavelengths — 
to determine the function that maps the blue contaminating 
wavelengths in the red wavelength domain: 


A 2 =-3815 + 3.271 Ai- 0.0001167 A? (6) 


where Ai is the wavelength in A of the first order, and 
A 2 the corresponding wavelength of the second-order con¬ 
taminating blue light. The residuals around this fit were of 
2.9 A. The flux reference tables used for computing the re¬ 
sponse curves, i.e., Tb and T, in the equations of the pre¬ 
vious section, were obtained for LTT 1020 from CTIC0 
( Hamuv et al.l 19921 1994) and for Feige 1K) from the CAL¬ 
SPEC database ( Bohlin. Dickinson & Cal zettill 2001 1) . 


Once the two response curves C\ and C 2 were derived, 
we computed the percentage of the blue light of a star that 
is expected to fall at each red wavelength, as 


P = 


9k 

c\ 


(7) 


where C 2 is C 2 reported into blue wavelengths using the 
relation above. The computation of P was repeated in two 
different epochs (using LTT 1020 and Feige 110 on 2008 
November 30, and LTT 9239 and Feige 110 on 28 August 
2011) and appeared stable within 1%. The contamination 
starts around 6 000A, as shown in Figure [121 and becomes 
severe around 6 500A. 


7.2 TNG second order correction 


The DOLORES red LR-R grism used in our SPSS cam¬ 
paign had a built-in order-blocking filter for wavelengths 
bluer than ~5 000 A, but unfortunately some second- 
order contamination from blue light was observed af¬ 
ter ~9500 /^l To obtain the percentual contamination, 
P, we used two well known standards: HZ 44 (a blue 
star, Landolt & Uomoto 20071) and G 146-76 (a red star. 


H0g el al. 200(1 A third star , GD 153 (another blue star, 
Bohlin. Colina & Finlev 19951) was observed to indepen¬ 
dently test the results. We also observed all LR-R spec¬ 
tra together with the Johnson I broadband filter in front of 


17 Grey conditions are attained when the cloud coverage produced grey 
extinction variations, i.e., when extinction does not alter significantly the 
spectral shape (within 1%). This condition is almost always verified in the 
case of veils or thin clouds dOkdll990l ; iPakstiene & Solheimll2003l) , and 
can easily be checked a posteriori. 


18 ftp://ftp.eso.org/pub/stecf/standards/ctiostan 

19 We have also observed an effect that could be ascribed to second order 
contamination in the blue LR-B grism, starting at 6 000 A approximately. 
Because that region is always adequately covered by the red LR-R grism, 
we simply cut the contaminated section of the LR-B spectra. 
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Fig. 13 Same as Figure[l2l but for one TNG spectrum of 
GD 153, taken in 2009 June. Apart from the telluric bands 
and the strong fringing that was not corrected for at this 
stage, the second order contamination — starting around 
9000-9500 A — is very well corrected even with our simple 
assumptions on the grism equation (see text). 


the spectrograph, to recover independently the uncontam¬ 
inated spectral shape. They were observed with the LR- 
B and LR-R grisms (see Table [l} both with a 5” slit, the 
widest available at the time, and with the 2” slit for a better 
wavelength calibration. We also tested GD 153 and KF06T2 
( Reach et al. 2005) on 2009 June 8 with similar results: the 
maximum difference between the two P curves was always 
lower than ~0.3%. All spectra were extracted and wave¬ 
length calibrated as in the NTT case, and reported to the 
same airmass with a tabulated extinction curviFl We joined 
the spectra arbitrarily at 5850 A, i.e., well before the start of 
second-order contamination. 

To determine a wavelength mapping relation, we ob¬ 
tained helium lamp spectra, with 1 h t exp , with a B filter 
to block first order light. Of the three measureable emission 
lines in LR-R, one was at 9553 A, where no He line is ex¬ 
pected. We thus could not derive a good fit of the function 
mapping the second order, as done for the NTT case. We 
used the simple formula Ai = 2 A 2 , that derives from the 
simplest form of the grating equation 


m\ = cr(sin j3 ± sin a) 


( 8 ) 


We then derived the Ci and C 2 curves and the percentage of 
contaminating flux using this basic assumption: the emis¬ 
sion line appearing at 9553 A corresponded well with the 
4713 A He emission line. 

We tested the correction on TNG observations of var¬ 
ious CALSPEC and literature standards (one example is 
reported in Figure fT3l ). We obtained good results, having 
differences with SPSS literature spectra of 1-2% at most, 
for several stars. However, the presence of fringing and a 

20 http://www.ing.iac.es/astronomy/observing/manuals/ps/tech_notes/ 
tn031 .pdf 


deep telluric band in the contaminated region (after 9500 A) 
causes high residuals, which can be ^10% in the worst 
cases. For many SPSS we had observations from other tele¬ 
scopes, but for the few SPSS observed only with TNG, a vi¬ 
able solution will be to replace the flu x ta bles after 9500 A 
with model spectra (see also Bohlinll2007h . 


8 Conclusions 


To build a grid of ~ 100 SPSS with <1% internal errors and 
< 1 -3% ex ternal errors (with respect to the Vega calibration 
by Bohlin 20071) we have carried out a systematic study of 
instrumental effects that could have an impact on the flux 
calibration of SPSS flux tables and integrated magnitudes, 
finding methods to beat each effect separately down, until 
their impact on fluxes (or ADUs) was below 1%. We col¬ 
lected from the literature the available methods and strate¬ 
gies to both evaluate and remove those instrumental effects, 
and carried out specific daytime and nighttime observations 
to quantify their effects. When necessary, we adapted the 
methods to our case, or developed our specific tests and 
data-reduction methods. Whenever feasible — given the 
difficulties to obtain telescope time for such programs — 
we compared different approaches to the same problem. In 
particular: 

- we characterized the employed CCDs to evaluate their 
minimum acceptable exposure time, the shutter delay, 
and the linearity limit, using different methods, and de¬ 
riving recommendations for our SPSS observing cam¬ 
paigns (see Tables[2]and[3]i: 

- we carried out a calibration frames monitoring plan, that 
provided recommendations on the minimum frequency 
of each calibration type observations (see Table 0 for 
our observations; 

- as part of our calibration frames monitoring, we devised 
a specific procedure to correct our spectra for fringing, 
after manipulation of spectroscopic flat fields (follow¬ 
ing the strategy bv lMalumuth et al.ll2003l) and using the 
IRAF telluric task; 

- we have also quantitatively studied the effect of gravity- 
induced lamp flexures at DOLORES @TNG; we found 
shifts of about 3 pixels, with 0.01-0.06 errors, corre¬ 
sponding to almost 8 A in the wavelength space, con¬ 
firming that for this type of instruments day-time lamps 
are not sufficiently accurate. If night-time lamps re¬ 
quired too much observing time, we acquired high S/N 
day-time lamps and shifted them onto the night-time 
ones to perform the wavelength calibration; 

- we studied the effect of instrument-induced polariza¬ 
tion with both NTT and TNG, the two instruments that 
should have the largest polarization in our SPSS cam¬ 
paigns. We observed stars with polarization levels up 
to 4% at different instrument rotation angles, and con¬ 
cluded that even in the case of SPSS with those intrinsic 
polarization levels, the flux calibration remained stable 
within 1%; 
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- we tested and adapted to ou r case the method described 
by Sanchez-Blazauez et al. ( 20061) to correct our spec¬ 
tra for second order contamination; our method can be 
applied in photometric and grey nights with residuals 
generally within 1-2% for NTT and TNG. 


In summary, a few of the examined instrumental effects 
turned out to be negligible, while we devised specific meth¬ 
ods to correct for the remaining ones, reducing their impact 
on the flux calibration of SPSS magnitudes and spectra to 
<1%. The methods can be applied to a wide range of obser¬ 
vational programs on similar instruments. 

The actual reduction and analysis of the SPSS spectra 
will be presented in a subsequent paper, where we will fur¬ 
ther reduce the residuals of instrumental effect corrections 
by combining several independent observations for each 
SPSS. We will also replace the low S/N blue and red borders 
(roughly <3800-4000 A and >9500 A) with theoretical or 
semiempirical templates. 
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